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Abstract 

In the past 20 years, synthetic combinatorial methods have fundamentally advanced the ability to 
synthesize and screen large numbers of compounds for drug discovery and basic research. 
Mixture-based libraries and positional scanning deconvolution combine two approaches for the 
rapid identification of specific scaffolds and active ligands. Here we present a quantitative 
assessment of the screening of 32 positional scanning libraries in the identification of highly 
specific and selective ligands for two formylpeptide receptors. We also compare and contrast two 
mixture-based library approaches using a mathematical model to facilitate the selection of active 
scaffolds and libraries to be pursued for further evaluation. The flexibility demonstrated in the 
differently formatted mixture-based libraries allows for their screening in a wide range of assays. 
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1. Introduction 

Mixture-based combinatorial libraries, reviewed in [1-3], are an efficient and effective way 
to explore large, dense areas of the chemical space in an exponentially smaller number of 
samples In a positional scanning mixture-based combinatorial library, mixtures are 
systematically arranged and tested in order to determine those most likely to contain active 
compounds [4,5]. These data are then used to deconvolute the library by making the 
individual compounds from the functionalities of the most active mixtures. Recent advances 
in the numerical modeling of mixture-based combinatorial libraries [6] has led to a greater 
understanding of how the Harmonic Mean model, in conjunction with the presence of 
multiple structural analogs within each mixture, leads to the differentiation of mixtures 
containing active compounds from those that do not. Such models have also led to 
impressive estimates of the robustness of a mixture's activity to variations in the 
equimolarity of that mixture's constituent compounds [7]. Over the last 20 years the number 
of new positional scanning libraries, including scaffolds comprised of peptides, 
peptidomimetics, heterocycles, and other classes of small molecules, has increased and the 
total number of samples available for testing is in the thousands. 

In an effort to further increase efficiency and utility as this collection of libraries increases, 
we previously developed a strategy termed scaffold ranking for the rapid identification and 
ranking of active library scaffolds [3]. Figure 1 shows a simplified illustration of the 
screening process using mixture-based combinatorial libraries. In a scaffold ranking library, 
all compounds in the library are simultaneously present as a mixture in a single sample; 
Figure 1(A) shows two 27-compound scaffold ranking library samples, with the colors red, 
blue and yellow representing three choices of functionality at each of three positions. In 
general, scaffold ranking library samples can result from mixing the cleaved products of the 
complete positional scanning library or may be synthesized directly as a single mixture. The 
objective of using scaffold ranking Ubraries is to prioritize library scaffolds for future 
analysis, including positional scanning; as shown in Figure 1(A,B), the scaffold which 
includes the black active compound (represented by a triangle) is chosen for positional 
scanning because its scaffold ranking mixture is relatively more active when compared to 
the other scaffold shown (represented by a circle). Only a single positional scanning library 
is then tested [Figure 1(B)] and deconvoluted (by picking the most active mixtures at each 
position) in order to find the active compound [Figure 1(C)]. This process can be 
advantageous in low-throughput assays that would make numerous positional scanning 
library screenings impractical. The relative efficacy of the scaffold ranking approach 
provides clear support for its use in low-throughput assays, or costly assays including in 
vivo screening. The format in which scaffold ranking or positional scanning libraries are 
used in a particular lead discovery effort will depend on the resources and throughput of the 
assay. The flexibility of these two screening formats of mixture-based libraries represents a 
clear advantage for the rapid identification of active lead compounds. 

The use of both positional scanning libraries and scaffold ranking have been previously 
reported [3,8-1 1], but there has heretofore never been a comprehensive study comparing 
scaffold ranking results to positional scanning results across a large number of libraries. In 
particular, because screening all positional scanning libraries may not be practical in all 
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assays, determining the information both present and absent in a scaffold ranking, relative to 
positional scanning, is vital for proper usage of the scaffold ranking approach. 

Herein, we present such a study based on the screening of mixture-based libraries against the 
formylpeptide receptor (FPRl) and formylpeptide-likel receptor (FPR2) targets, two 
receptors that have been implicated in both cancer [12] and inflammatory responses [13]. 
Thirty-two positional scanning libraries (Figure 2 and Table SI) were tested in their entirety 
(for a total of 4,304 samples), along with the corresponding 32 scaffold ranking samples. 
Detailed methodologies and analyses of structures and activities of the individual 
compounds discovered in this campaign will be presented elsewhere [14,15]. In this study, 
we present and demonstrate quantitative tools that analyze and use the information present 
in a positional scanning library screening most important to increasing the likelihood of a 
successful deconvolution. We also focus on comparing and contrasting the scaffold ranking 
and positional scanning screening approaches from a mathematical modeling perspective. 
The results presented here demonstrate that the scaffold ranking Ubrary samples lead to 
effective selection of active positional scanning libraries; consequently, determining the 
relative activities of the libraries as the first step of a screening campaign does not require 
the use of the complete collection of positional scanning libraries. This strategy greatly 
reduces the time and resources required by testing a fraction of the samples with equivalent 
accuracy. However, it will be also shown that use of the complete collection of positional 
scanning libraries provides screening data that offers important information, beyond activity 
alone, which increases the likelihood of the successful deconvolution of a library. 

2. Results and Discussion 

2.1. Comparison of Scaffold Ranking and Positional Scanning Using the Harmonic Mean 

A positional scanning library is systematically arranged so that, at each position of diversity, 
every individual compound in the library appears in exactly one mixture in an approximately 
equimolar fashion. Because of this, an equimolar combination of all of the mixtures in one 
position of a positional scanning library will result in an equimolar mixture of all the 
compounds within that library, i.e., the scaffold ranking mixture associated with that library. 
As previously described, the Harmonic Mean Model accurately describes the activity of a 
mixture given the activity of its constituents in a simple independent binding assay, such as 
the data in this study [6]. Thus, for a positional scanning library the harmonic mean model 
would suggest that; 

N 

/C50, 



ScaffoldRankingMixture ^ — 



E 



/C50,: 



Here ICSOjj, is the IC50 of the mixture of the k'^^ position, with A' total functionalities, of 
a positional scanning library, and IC5Q ScaffoldRankingMixture is the /C50 of the scaffold 
ranking mixture associated with that library. 

In a high throughput screening (HTS) assay, it is not typical to have dose-response curves 
for all samples. Such was the case in this study, in which each positional scanning library 
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sample was tested for inhibition of a fluorescent ligand binding to either FPRl or FPR2 in 
duplicate at 10 |Jg/niL and averaged [14]. Each of the scaffold ranking samples was tested in 
duplicate at 10 Hg/mL and 5 |Jg/mL. Because the Harmonic Mean Model uses the ICjos of 
samples, it was necessary to extrapolate ICjos for samples using the equation: 



Here [x] is the concentration tested and is the percent inhibition value obtained at that 
concentration. Because %/„/, values close to zero (or negative) could result in arbitrarily high 
(or negative) x/C50 values, 1,000 (i.e., the value obtained when %;„/, f« 1% at [x] = 10) was 
chosen as an upper bound. Obviously, these values are extrapolations and not a substitute for 
actual experimentally determined IC5Qs, but they are sufficient for order-of-magnitude 
estimation. 

All averaged percent inhibition data from all positions of all the positional scanning libraries 
tested were converted to xICSOs and the harmonic mean was taken by position for both 
receptor targets. The four measured percent inhibitions for each scaffold ranking sample 
were converted to xICSQs and their average and standard error were calculated. The results 
are illustrated in Table 1 and Figure 3. The most immediate observation is that the most 
active library, 19, is detected equally well using either technique; when purely viewed as a 
method of determining the most potentially active scaffold, using the scaffold ranking 
libraries is equally effective to using the positional scanning libraries but requires testing of 
less than one percent of the samples (32 samples versus 4,304 samples). 

Considering the inherent inaccuracy of single-dose /C50 extrapolations one would not 
expect perfect correspondences between scaffold ranking xICSOs and the harmonic mean of 
a position's xICSQs. In general, however, scaffold ranking activities corresponded well to 
those obtained by harmonic meaning each position; only three comparisons resulted in even 
a four-fold disparity against the average harmonic mean of its positional scanning library, 
and 41 of the 64 total comparisons had under a two-fold disparity. Many differences were 
the result of the scaffold ranking xICSQ being 1,000, and the positional scanning harmonic 
means being lower; this is unsurprising, since by imposing a cap on xICSQ values, errors 
would necessarily be one-sided. The three largest deviations, however, were all 
overestimates: Libraries 5, 19, and 24 against FPRl. Library 19 had the highest error (over 
five-fold more active than the average harmonic mean of its positional scanning library) but 
was the most active library against FPRl in either case. Libraries 5 and 24, while showing 
above-average activity in their positional scanning samples, were not actually the second- 
and third- most active libraries against FPRl; library 32, whose scaffold ranking xIC5Q and 
positional scanning harmonic means corresponded quite well, was actually the second-most 
active overall. It should be noted, however, libraries 5 and 24 do not exhibit substantially 
less active positional scanning harmonic means than library 32. 



2.2. Analysis of Positional Scanning Profiles 

As shown above, scaffold ranking is equally capable of gauging the overall activity of a 
given library. However, when the assay throughput rate allows, there is a wealth of 
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additional information present in a full screening of all positional scanning libraries that can 
aid in choosing the most promising Ubraries to deconvolute. One of the most important 
aspects of a positional scanning activity profile is the level of activity differentiation of 
samples at each position. Given the same overall library activity, a positional scanning 
activity profile that shows few mixtures at each position that are much more active than the 
rest is likelier to have compounds that are more active than one with Uttle differentiation. To 
see why this is the case, consider a library with a scaffold ranking sample /C50 of 100 |jM 
containing inactive compounds with IC5Qs of 1,000 |jM and an unknown percentage of 
active compounds of fixed unknown activity. Under the assumptions of the Harmonic Mean 
model, such a library could theoretically have a composition of compounds ranging from 
100% of compounds with IC5Qs of 100 [oM each, to 0.01% of compounds with an /C50 of 
1 1 nM each, to even smaller percentages of even more active compounds. If, in such a 
library, a position contained only one mixture that exhibited activity higher than that of an 
inactive compound (therefore being a well-differentiated profile), then that mixture would 
need to have a very high relative activity (so that the harmonic mean of that position would 
come out to 100 |jM), and thus the vast majority of the active compounds would be 
mathematically required to be within that mixture. Since that mixture represents only a 
fraction of the total library, this in turn puts an upper bound on the percentage of active 
compounds that could be in the library; as presented above, the lower the percentage of 
active compounds, the greater the required activity of each active compound. In contrast, if a 
position contained mixtures all with approximately the same activity, then these mixtures' 
IC5Qs must be approximately 100 |jM each in order for their harmonic mean to be 100 |jM. 
Thus each mixture would be required to have approximately the same number of active 
compounds, and so no upper bound can be placed on the overall percentage of active 
compounds. 

In an effort to quantify the activity profile of a positional scanning library position that 
models activity differentiation, the following procedure was developed. For a given position 

with n functional groups, let {x^.}^_^ be the rank-ordered activities of the mixtures in that 
position, so that is the most active mixture's activity, X2 is the second-most active 
mixture's activity, etc. In this study, percentage inhibition values were used for the 
activities; since we are attempting to compare the differentiation of positional scanning 
profiles within a single study, absolute scaling issues are irrelevant so long as they are 
consistent, and so long as higher numbers correspond to greater activity. Next, the maximum 
drop in activity: 

m= max (xj. - Xfc+j) 

fc=l,...,n— 1 

was calculated. This represents the maximum sequential activity difference within the 
position; clearly, the more difference between active and inactive mixtures, the greater m. 
The value of k for which the largest drop occurs, K, is calculated as well: 

K= argmax (xk - Xk+i) 

fe=l,...,n-l 
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For an ideally differentiated positional scanning library activity profile, then, one would see 
high activity differences between active and inactive mixtures (i.e., a high value of m) in a 
relatively small number of mixtures (i.e., a low value of K). To this end, the index of 
differentiation of a positional scanning position's profile is defined as: 

m 

^ DIFF — 2K — I 



The values of Ioiff for each position of each of the 32 libraries in this study are shown in 
Table 2. Selected profiles illustrating high and low differentiation are shown in Figure 4. 
Note that Ioiff can vary greatly from position to position in a given library; this is 
unsurprising, since specific functionalities at certain positions will inevitably be more 
important to the activity potential of a compound than others. Library 32 exhibited by far the 
highest average index of differentiation for FPRl, having the highest single position Idiff^ 
and two remaining positions ranking 6th and 1 1th. Libraries 20, 21, and 24 showed 
relatively high differentiation in some positions, but not all, and had the next highest average 
Idiff- For FPR2, library 19 had the highest average Idiff^ followed closely by libraries 20 
and 29; all three exhibited high-ranking differentiation in two of their positions. 

As reasoned above, high differentiation is very important for potentiating the discovery of 
highly active compounds in a positional scanning screening profile. Such differentiation in 
the absence of overall activity, however, may only result in varying degrees of inactive 
compounds. Therefore, the overall potentiation index of deconvolutability of a library is 
better quantified as: 

^ _ Average I^j^p Over All Positions 

DECON Average Harmonic Mean of xICBOs Over All Positions ""^^ 

The values of Idecon for each library are in Table 2 and graphed in Figure 5. As is evident, 
each receptor has one standout library: library 32 for FPRl, because of high relative activity 
and very high relative differentiation, and library 19 for FPR2 (which had the second highest 
score in FPRl as well), because of very high relative activity and high relative 
differentiation. Indeed, these libraries were the two chosen in this study for deconvolution, 
and both proved to lead to the identification of highly active individual compounds with 
nanomolar Kj values [14]. 

2.3. Selectivity in Scaffold Ranking and Positional Scanning 

In the event that selectivity is a desirable endpoint in a study, as it was in this study, 
additional important lessons can be learned about the relative utility of screening scaffold 
ranking libraries versus complete positional scanning libraries. As has already been noted, 
library 19 showed the highest level of overall scaffold ranking activity in both receptors. 
Library 32, in contrast, only showed substantial scaffold ranking activity against the FPRl 
target. Using this information to infer that library 19 could not include selective compounds, 
however, would not be an appropriate use of the activity of the scaffold ranking samples. 
The absence of activity in FPR2 for library 32 did indeed imply, both in its positional 
scanning profile and its eventual deconvolution, an absence of FPR2-active individual 



Molecules. Author manuscript; available in PMC 2014 July 22. 



Santos et al. 



Page 7 



compounds. The reverse, however, proved not to be true, as is evident from a closer 
inspection of library 19's positional scanning activity profile (Figure 6). Although Ubrary 19 
exhibits overall high activity against both targets, the mixtures at each position that exhibit 
that activity vary greatly; FPR2 shows greater differentiation in the first two positions (as 
evidenced by its higher index of differentiation as described above), and the mixtures of 
maximum activity do not correspond to those of FPRl. These patterns persisted when 
individual compounds were tested. Thus, positional scanning libraries should be selected 
and screened even if the scaffold ranking screening does not show the desired selectivity. 
Positional scanning libraries offer a window into the possibility of additional selectivity of 
individual compounds that would not be evident in the analysis of the scaffold ranking 
library's activity alone. 

3. Conclusions 

In the past, scaffold ranking has been used as a first step for determining which library will 
be tested using positional scanning. With the side-by-side data presented in this study, we 
have shown for the first time that scaffold ranking is indeed sufficient for accurately 
demonstrating the overall activity of a library, with each library presenting essentially the 
same activity levels in its scaffold ranking format as in its full positional scanning format. 
However, we have also demonstrated that, when feasible, complete screening of all 
positional scanning libraries allows for additional analyses of the differentiation and 
selectivity that can drastically increase the likelihood of a successful deconvolution. If only 
the scaffold ranking samples had been tested, library 19 surely would have been chosen, 
based on the basis of its activity, to screen the complete positional scanning library; as we 
have shown in this study, to exclude a library on the grounds of selectivity using only 
scaffold ranking information is a mistake. The potential of identifying selective compounds 
is only revealed from analysis of its positional scanning profile. As will be presented in a 
complementary study, 106 individual compounds were synthesized and tested from library 
19 [14]. Nineteen compounds had Ki values < 100 nM for FPRl, of which 15 were FPRl 
selective (Ki values for FPR2 are more than 100-fold greater); 23 compounds had Ki values 
< 100 nM for FPR2, of which 12 were selective for FPR2. Furthermore, Library 32, with 
less activity exhibited in the scaffold ranking than other libraries, may not have been 
explored at all, had its impressively differentiated profile not been determined through 
screening its positional scanning library. Deconvolution of library 32 resulted in the 
synthesis of only eight individual compounds, of which fom- had Ki values < 20 nM in FPRl 
and were highly selective. Additional libraries (library 24 for FPRl, and libraries 20 and 29 
for FPR2) that have not yet been deconvoluted show about the same indices of 
deconvolutability as the successfully deconvoluted library 19 for FPRl; these are clearly a 
possible direction for future research. By having the scaffold ranking data in tandem with 
the positional scanning data, one is better able to see the strengths and weaknesses of each 
approach, and use this knowledge to further increase the effectiveness of akeady-effective 
mixture-based combinatorial library screening. 
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(C) Individual Active Compound 




Figure 1. 

A schematic tracing an active compound (black) through the combinatorial library screening 
process, from scaffold ranking (A) to positional scanning (B) to individual compound 
synthesis (C). 
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Figure 2. 

Thirty two small-molecule libraries tested against FPRl and FPR2. 
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Figure 3. 

Comparison of the extrapolated scaffold ranking /C50 of each library (SR, shown as red 
stars), and the harmonic means of the extrapolated IC50s of each position of the positional 
scanning libraries samples (PI, P2, P3 and P4, shown as blue circles). 
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Library 32, Position 2, FPRl, Idiff=57.70 
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Figure 4. 

Examples of very high differentiation (Library 32, Position 2, for FPRl) and little 
differentiation (Library 19, Position 2, for FPRl) in positional scanning profiles, as defined 
in Equation (5). Note that overall. Library 19 exhibits more activity, but Library 32 is clearly 
more well-differentiated. Additional zero percent inhibition values have been removed from 
Library 32' s profile for clarity. 
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Figure 5. 

Indices of Deconvolutability for each library, as defined in Equation (6), against both 
targets. 
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Library 19, Position 1 



70.0 

60.0 

50.0 

J 40.0 
I 

S 30.0 
20X> 

10.0 
0.0 



















■ 




ll jJl 



IFPRI 
IFPR2 



70.0 



60.0 



50.0 



40.0 



0.0 



Library 19, Position 2 















klllJ^jiuUl 




i 







IFPt?l 



30.0 
2iO 

J 20.0 
= 15,0 
10.0 
5.0 
0.0 



Library 19, Position 3 





























1 1 


— 













■ FPRl 
IFPR2 



Library 19, Position 4 




I FPRl 
I FPR2 



Figure 6. 
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The full positional scanning profile of Library 19. Notice that there are many instances of 
different mixtures among the most active at the FPRl target not being active at the FPR2 
target, and vice versa. This indicates the potential selectivity that was eventually found. 
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